Function Approximation for Continuous Constrained MDPs
نویسندگان
چکیده
In this work we apply function approximation techniques to solve continuous, constrained Markov Decision Processes (MDPs). Many real-world robot planning problems are best represented as MDPs with a continuous state space. However, in many scenarios constraints must be accounted for as well. These constraints are treated probabilistically, with a bound on the constraint violation probability. Existing function approximation literature does not account for constraints, and existing constrained MDP work assumes a discrete state space. We seek to bridge this gap by maintaining two continuous value functions one for the reward and one for the constraint violation probability and approximate these value functions by writing them as a linear combination of a set of basis functions (not necessarily the same set for both value functions). We present some simulation results and investigate the impact of choice of basis functions on the quality of policies generated.
منابع مشابه
Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation
We propose solution methods for previouslyunsolved constrained MDPs in which actions can continuously modify the transition probabilities within some acceptable sets. While many methods have been proposed to solve regular MDPs with large state sets, there are few practical approaches for solving constrained MDPs with large action sets. In particular, we show that the continuous action sets can ...
متن کاملSymmetric Primal-Dual Approximate Linear Programming for Factored MDPs
A weakness of classical Markov decision processes is that they scale very poorly due to the flat state-space representation. Factored MDPs address this representational problem by exploiting problem structure to specify the transition and reward functions of an MDP in a compact manner. However, in general, solutions to factored MDPs do not retain the structure and compactness of the problem rep...
متن کاملBounded Approximate Symbolic Dynamic Programming for Hybrid MDPs
Recent advances in symbolic dynamic programming (SDP) combined with the extended algebraic decision diagram (XADD) data structure have provided exact solutions for mixed discrete and continuous (hybrid) MDPs with piecewise linear dynamics and continuous actions. Since XADD-based exact solutions may grow intractably large for many problems, we propose a bounded error compression technique for XA...
متن کاملContinuous Discrete Variable Optimization of Structures Using Approximation Methods
Optimum design of structures is achieved while the design variables are continuous and discrete. To reduce the computational work involved in the optimization process, all the functions that are expensive to evaluate, are approximated. To approximate these functions, a semi quadratic function is employed. Only the diagonal terms of the Hessian matrix are used and these elements are estimated fr...
متن کاملPiecewise Linear Value Function Approximation for Factored MDP
A number of proposals have been put forth in recent years for the solution of Markov decision processes (MDPs) whose state (and sometimes action) spaces are factored. One recent class of methods involves linear value function approximation, where the optimal value function is assumed to be a linear combination of some set of basis functions, with the aim of finding suitable weights. While sophi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011